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APPLICANT: Parrott, Wayne 
LaFayette, Peter 
Kane, Patrick 

TITLE OF INVENTION: Arabitol or Ribitol As Positive Selectable Markers 
FILE REFERENCE: UGA-8 55R 

CURRENT APPLICATION NUMBER: US/09/802 , 208B 
CURRENT FILING DATE: 2001-03-08 

NUMBER OF SEQ ID NOS : 5 
SOFTWARE: Patentin version 3.0 
SEQ ID NO: 1 
LENGTH: 1848 
TYPE: DNA 

ORGANISM: Escherichia coli 
SEQUENCE : 1 

atgaacgaac aatttacatg gctgcacatc gggttaggtt cttttcatcg cgcacatcag 60 

gcgtggtatc tacaccgttt gcaggtgatg ggcgataaac gctggagcat tgctgcgggc 120 

aatattcgta atgatgctga acatgtcgta caggcactca gtgcacagaa aggtcgctat 180 

gtgctggaaa ccgtcagccc ggaaggggta agcgaatatg aagagatcac ctcaattcag 24 0 

aagttgatac cgtggcaggc agatttacaa ccgctgattg ctgaaggggc agatccgaag 300 

acaaaagtga ttgctttcac cgtcaccgaa ggcgggtact acctgaatac cagtcacaaa 360 

ctggaagtta acaatcctga tttagcggca gatcttaaag ggggatgcaa aacaatttac 4 20 

ggtgttatta cccgtatcct cgaagcgcgt atggcaaata acgccggacc actaaccctg 480 

atgaacgaac aatttacatg gctgcacatc gggttaggtt cttttcatcg cgcacatcag 54 0 

gcgtggtatc tacaccgttt gcaggtgatg ggcgataaac gctggagcat tgctgcgggc 600 

aatattcgta atgatgctga acatgtcgta caggcactca gtgcacagaa aggtcgctat 660 

gtgctggaaa ccgtcagccc ggaaggggta agcgaatatg aagagatcac ctcaattcag 720 

aagttgatac cgtggcaggc agatttacaa ccgctgattg ctgaaggggc agatccgaag 780 

acaaaagtga ttgctttcac cgtcaccgaa ggcgggtact acctgaatac cagtcacaaa 84 0 

ctggaagtta acaatcctga tttagcggca gatcttaaag ggggatgcaa aacaatttac 900 

ggtgttatta cccgtatcct cgaagcgcgt atggcaaata acgccggacc actaaccctg 960 

ctgaattgcg ataacgtgcg ccataatggt gaacgtttcc atgatggcct ggttgagttt 1020 

ctccagctaa ctggcaaaca ggatgtcatc gactggctga gtacaaatac cacttgcccg 1080 

aataccatgg ttgaccgcat tacgcctcgt ccggcagcag aacttccggc acggatcaag 114 0 

gctcaaacgg gtattgccga taaagcgccg gtaatgggcg aaacctttat ccagtgggtc 1200 

gtggaagata atttccgtga tgtccgtccg gcactggaga aggtcggtgt cgaactggtg 1260 

gcgtcggtaa tcccctatga agaggcgaag attcgcattc ttaactcttc acacagttgc 1320 

atcgcctggg caggtacgtt aatcggtcaa aaatatatcc acgaaagcac aatgaccgat 1380 

tttatctatc agattgccga ccgctacgtg acggaagatg tcattccttg cttgggcgat 1440 

aacggtatcg atttgccaac ctaccgtgat gttgtactca agcgttttac caatccacat 1500 

attcaggaca ccaaccaacg cgtcgctgcg gatggtttct cgaaaattcc ggcgatgatt 156 0 

gcccccacac tgcgagagtg ctaccagcga ggcgttcgcc cgaatgccac cgccatgtta 1620 

cctgcactgt tttacgtatt catggagcag tggcatcacg gcaaactgcc ctatgaatat 1680 

caggatggca tccttgatgc accagctgtc catgcaatgt tacagtctgc cgatcccgtc 1740 
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45 gctgtttatg ccagtgataa agcgctgttt ggcgatttaa ccgaacgtga agattttgcc 1800 

46 gcgttgttgc gcgaaaaaat cgctgacgtc tacgcgttaa ttaactaa 1848 
4 8 <210> SEQ ID NO: 2 

49 <211> LENGTH: 3971 

50 <212> TYPE: DNA 

51 <213> ORGANISM: Escherichia coll 

52 <220> FEATURE: 

53 <221> NAME/KEY: misc_feature 

54 <222> LOCATION: (96).. (848) 

55 <223> OTHER INFORMATION: ribitol dehydrogenase coding region 

56 <220> FEATURE: 

57 <221> NAME/KEY: misc_feature 

58 <222> LOCATION: ( 859 ).,( 2463 ) 

59 <223> OTHER INFORMATION: ribitol kinase coding region 

60 <220> FEATURE: 

61 <221> NAME/KEY: misc_feature 

62 <222> LOCATION: ( 256 5 ) . . ( 3839 ) 

6 3 <223> OTHER INFORMATION: ribitol transporter coding region 

64 <4 00> SEQUENCE: 2 

65 atcgattgag cagtttgctt cacacggcag ctaaattccc gttcagtgcg tgcaagcaaa 60 

66 cagagactat aaattcgccc tggtaaaagg attatatgat gaatcactct gtgccctcta 120 

67 tgaatactcc ccttaatggc aaagttgcag ctatcactgg cgctgcgtca ggtattggcc 180 

68 tgcaatgtgc aaaaacgctg ctcgatgcag gagcaaaggt agtattgatt gaccgggaag 240 

69 gcgacaaact gcacaagatt gtcgctgagt taggcgaaaa cgcgtacgcg ctgcaactcg 300 

70 atctcttcaa taatcagcaa gtcgataaca tgctggcgga cattatcgaa ctggcgggtg 360 

71 ggctggatat ttttcatgcc aatgcaggcg cttatattgg cggcccagtg gctgaaggtg 4 20 

72 atccagatgt ctgggatcgt gtgttaaatc tgaatataaa tgcggcgttt cgctgtgtcc 480 

7 3 gtgcagtcct gccgcatatg attgcgcaga ggtcgggcga tataattttt accagttcca 54 0 
74 tcgcgggcgt cgttccggtt atctgggaac cgatctacac cgcgtccaaa tttgccgttc 600 
75. aggcattcgt acacactacc cgccgccagg tttctcaata tggcgtgcgt gtgggtgcgg 660 

76 tgctgccagg accagtagtc actgccctgc ttgatgactg gccaaaagcc aaaatggaag 720 

77 aagccctggc aaatggtagc ctgatgcaac cgattgaagt ggcggaatca gtattgttta 780 

78 tggtgacccg ctcgaaaaat gtcaccgtgc gagatttagt gatcctgcct ggcagtgtcg 84 0 

79 atctgtaagg gcgcaatcat gacaataacc aaaaccgtta ttggtgttga tgtgggatca 900 

80 ggcagtgtcc gcgccgggat ttttgatctc aacggatctc tgctatccca tgccacagaa 960 

81 aaaatcacga ctacgcggcg cagcggaagc cgcgtggaac agtccagcca ggagatctgg 1020 

82 caggcggtct gttcatgtat tcgtaatgcg ctcactctgg cagacgtttg tgcacaaagt 1080 

83 gtggcaggca tcggttttga tgccacctgt tctctggtgg tactggataa aaacggtgat 1140 

84 ccattgcctg tcagcccgga aggagatgca aagcaaaata tcattgtgtg gatggatcac 1200 

85 cgcgccaccg aacaagcaga gcgaatcaat gccactcacc atccggtgct gaactacgtc 1260 

86 ggtggtaaaa tttcgcctga aatggaaaca ccgaaaattc tctggctgaa agaaaatatg 1320 

87 ccagagatct acgaacgtgc cggacaattt ttcgatctgg ccgattttct gacctggcgg 1380 

88 gctaccggtg atttagcgcg ttcagtatgc actgttacct gtaaatggac gtggctggca 1440 

89 catgaaaatc gctgggatcc agattatttc cgcaccattg gccttgcaga gttagcggat 1500 

90 gaagatttta ttcgcattgg tcatcatatt gtttctcccg gaacaccttg tggaaatggt 1560 

91 ttaacagcac aagccgcggc agagatggga ttactccccg gcacacctgt cgctgtaggg 1620 

92 ttaattgatg ctcacgctgg tggcatcggt acggttggcg tagaaggtgg agcgctgaac 1680 

93 aatctcgcgt atgttttcgg cacttcttca tgcaccatgg catctaccac ttctccctcg 1740 

94 tttgtaccgg gtgtctgggg gccgtattac agtgcgatgg ttccaggtct gtggttagtt 1800 
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95 gaaggcgggc aaagtgctgc gggagcagct attgaccagc tacttgattt ccatccggct 1860 

96 gttgaagaag ctcgcgaaat ggcacaacgt gtgaatcagc ccctccccgt ctggcttgct 1920 

97 gatcgaatcc tcgaaaaaac ggcgcaacca tcagatgctg tcgccctggc gaaagggcta 1980 

98 cacgtggtgc cggaatttct cggaaatcgc gcccccttcg cagatcctca tgccagagcg 2040 

99 gtaatttgtg gcctgggtat ggagcgagat ctggataatt tactcgcctt gtatatcgct 2100 

100 ggattatgcg gaattggtta tggtctgcgc caaattctcg acgctcaaac agcgcaggga 2160 

101 gtagtgagta aaaatatcgt tattagcggc ggtgccgggc agcatccact ggtacgacaa 2220 

102 attctggcag atacctgcgg tattccggtc attaccacgc aatgctgcga accggtttta 2280 

103 ttaggctcgg ccattcttgg tgctgtcgcc ggaaatattg caccttctgt tggcgaagcg 2340 

104 atgcaacaat tcacccatgt ggataaatat tattatccgc aagaacgcta tcaatctctt 2400 

105 catcatcgtc gatatgaggc ctataagcag ttgcagcata ctgcaaaatt actcagagac 2460 

106 taattaacca gccgcctgac gctgttttca ggcaatcact aataacgact cactccggta 2520 

107 atatcccgga gtgcattcat ctgcacccta aaaacgaggt ttatatgtcc agaaataata 2580 

108 aacagtggtt gggtttgcca ctgcatctga tatggggata tatcgccatc gcagtattta 2640 

109 tgactggtga tggtttcgaa ctcgcattct tatcgcacta tattaaatcg ttaggcttca 2700 

110 cacctgcgga agcctctttt gcctttacgc tctacggcct ggcggctgcc ctttccgcct 2760 

111 gggtttctgg ggtagtagcg gaaatcatca cgccgcaaaa aaccatgctg attggttttg 2820 

112 tcctatggtg cgtattccat gttctgtttc tggtctttgg attaggacag gcaaactatg 2880 

113 ggttaatcct gctgttttac gggattcgtg gtttagcgta tccgctattt ctctactcat 2940 

114 ttatcgtcgt tattattcat aacgtgcgta gcgaaaattc cagttctgct ctgggttggt 3000 

115 actgggcggt atattcagta gggatcggtg ttgctggcag ttatattccc agttttacga 3060 

116 taccgattat gggtgaaatg ggaaccttat ggctggcact ggcgttctgc tttgctggcg 3120 

117 gtgtcattgc catgatctcc ttgcgtcatg ttaaaacgcc tggacatatg cataatttaa 3180 

118 ctccccgtga gaagtttgca gaattaagtc gggcagtaac tttactttat accaaccgca 3240 

119 atatttttct ctccagtatt gtgcgcatta taaatacctt atcgttattc ggttttgcgg 3300 

120 tcattatgcc aatgatgttt gtggatgaac tgggattcac cacctctgaa tggttgcagg 3360 

121 tctgggcggc atttttcttt accactattt tctctaatat tttttggggg attgtggcag 3420 

122 aaaaaatggg ctggatgcgt gttattcgct ggtttggttg cctcgggatg gcagcatcaa 34 80 

123 gtttagcgtt ttactacatg ccgcaatact ttggtcacaa ctactggatg gcaatgattc 354 0 

124 cggcgattgc tctgggaact tttgttgctg catttgtgcc gatggccgct gtcttcccgg 3600 

125 cactggaacc aaaacacaaa ggtgctgcaa tctcggttta caacctctct gcgggtatgt 3660 

126 ctaacttcct ggctccggca attgccgtgg tgttattacc gtggtttagc actatcggtg 3720 

127 tggtcattgc ctatacagca ttgtatctat tggcctttgt cctttgcgca ttcattcgcg 3780 

128 ttgagcagcc aggattcagt tctgcgccag tgactgagaa ggcattgaat atctcctgaa 3840 

129 aaacgaaacg catcaggcac tcatcctctc cctcatggga gaggatgatt tcacatcagg 3900 

130 caatagtgac tttgttatcg agataaacgt cctgcacggc gttaatcagt ttcacgccgt 3960 

131 cagccatcga t 3971 

133 <210> SEQ ID NO: 3 

134 <211> LENGTH: 250 

135 <212> TYPE: PRT 

136 <213> ORGANISM: Escherichia coli 

137 <4 00> SEQUENCE: 3 

138 Met Met Asn His Ser Val Pro Ser Met Asn Thr Pro Leu Asn Gly Lys 

139 15 10 15 

140 Val Ala Ala He Thr Gly Ala Ala Ser Gly He Gly Leu Gin Cys Ala 

141 20 25 30 

142 Lys Thr Leu Leu Asp Ala Gly Ala Lys Val Val Leu He Asp Arg Glu 

143 35 40 45 

144 Gly Asp Lys Leu His Lys He Val Ala Glu Leu Gly Glu Asn Ala Tyr 
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